Temperature-profiled device fingerprint generation and authentication from power-up states of static cells

ABSTRACT

A method, system and computer program product for generating device fingerprints and authenticating devices uses initial states of internal storage cells after each of a number multiple power cycles for each of a number of device temperatures to generate a device fingerprint. The device fingerprint may include pairs of expected values for each of the internal storage cells and a corresponding probability that the storage cell will assume the expected value. Storage cells that have expected values varying over the multiple temperatures may be excluded from the fingerprint. A device is authenticated by a similarity algorithm that uses a match of the expected values from a known fingerprint with power-up values from an unknown device, weighting the comparisons by the probability for each cell to compute a similarity measure.

This U.S. patent application is a Continuation of U.S. patent application Ser. No. 13/483,252, filed on May 30, 2012, which is a Continuation of Ser. No. 12/146,997, filed on Jun. 26, 2008 and issued as U.S. Pat. No. 8,219,857 on Jul. 10, 2012, and claims priority thereto under 35 U.S.C. §120.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention is related to device identification technologies for security, general identification and protection of intellectual property (IP) in electronic systems.

2. Description of Related Art

Reliable and secure identification methods for integrated circuits are increasingly desirable, in part, to mitigate a burgeoning problem of piracy of intellectual property and related trends including unauthorized code execution. The shrinking of feature sizes of semiconductor devices by technology scaling has reached the point where process variation is becoming a limiting constraint on the fabrication process. Techniques for proving copying have included placing physical identifiers, such as microscopic logos, text, and even holograms in or on a die. Such mechanisms do not provide ready identification from electronic means. Techniques for securing electronic systems typically use non-volatile storage for storing security keys and the like for use in validating that a proper device or subsystem is authentic. However, such techniques require additional circuitry that is subject to electronic intrusion techniques that once effective, can provide for duplication of the security keys.

While perceived as a problem for reliable manufacturing, process variation can be exploited in various ways for the identification of individual devices. Existing device identification strategies have used power-up values of SRAM cells to perform device identification, and developments in the area have typically focused on coping with the unreliability of using the initial power-up state of SRAM cells as a device identification tool. In particular, when collecting state data after successive power-cycles, some of the cells of a SRAM array assume the same value after every power-cycle, while others do not. Cells with a reliable power-up value are considered stable cells and those without a reliable power-up value are considered unstable cells. Existing identification methods take the supposedly probabilistic behavior of the SRAM cell power-up state into account, typically by performing multiple power-up state evaluations to determine the expected power-up values according to the law of large numbers. In some techniques, SRAM power-up values are used for random number generation.

A SRAM fingerprint is a bit string that may be determined from multiple power-cycle SRAM state evaluations. During each evaluation, each SRAM cell assumes a power-up state value, either 0 or 1. Table I illustrates the process of determining an SRAM fingerprint for an SRAM with 6 bits in positions [b0:b5] based on 8 power-up state evaluations. In the example illustrated by Table I, Bits 1, 3, and 4 are stable in that their power-up value is the same for each evaluation. In contrast, bits 0, 2, and 5 are unstable because they assume different power-up values. A simple and typical choice for generating an SRAM fingerprint is to include the unique power-up values of the stable bits, and for the unstable bits to choose the power-up value that occurs in the majority of all experiments. If both values 0 and 1 occur equally often, either value may be used, and the value used in the fingerprint may be selected randomly. For example, value 1 may be chosen for the expected value of bit 5 in the SRAM fingerprint illustrated in Table I.

TABLE I power-up power-up values experiment 0 1 2 3 4 5 1 0 1 1 0 1 0 2 0 1 0 0 1 1 3 0 1 1 0 1 0 4 0 1 1 0 1 1 5 1 1 1 0 1 0 6 0 1 0 0 1 1 7 0 1 1 0 1 0 8 0 1 1 0 1 1 fingerprint 0 1 1 0 1 1

Once the SRAM fingerprint is obtained, for example during manufacturing testing of a device, the fingerprint can be later used in challenges that validate the unique identity of the device, by performing field evaluations that typically repeat the power-up state evaluations to match an expected fingerprint with the states assumed by an SRAM device after multiple power cycles. However, the nature of the probabilistic models assumed in the existing techniques limits the reliability of such identification/validation techniques, which can require large amounts of data and still be subject to error.

Therefore, it would be desirable to provide a method, system and computer program product for generating a reliable fingerprint from a device having static storage cells from power-up state evaluations. It would further be desirable to provide a reliable matching criteria that provides a high level of validation confidence in identifying whether or not a particular device matches a given fingerprint.

BRIEF SUMMARY OF THE INVENTION

The above objectives of generating a reliable fingerprint that provides reliable and secure device identification based upon power-up static cell values, is accomplished in a computer performed method, and workstation computer, which may be a computer-controlled test system. The method is a method of operation of the computer system, which may be at least partially embodied in a computer program product including program instructions stored in computer-readable storage media for execution in a workstation computer system.

The computer system and method receive measurement data indicative of power-up states of static storage cells in a device, which may be obtained at wafer-level or device-level testing, or alternatively from a device via an interface at any time, such as from a scan chain or bus interface, e.g., a memory device may be “read” via its primary data interface to obtain the power-up values/states. A unique device identifier is generated from the power-up state data. The states of the static storage cells are collected after multiple power cycles to form sets of power-up states. The sets are collected for multiple temperatures, and optionally multiple power supply voltages, to form a collection of sets of power-up states. The state data are reduced by computing expected values for the state of each cell, and may be further reduced by merging the data for the multiple temperature ranges. The identifier may be generated as a bit string having fields corresponding to pairs of values for each cell: 1) the expected power-up value of the cell; and 2) the probability that the power-up value of the cell will assume the expected value. Data for cells having expected values that change over temperature may be excluded from the identifier, by including static values in fields of the identifier corresponding to the cells that vary, so that those cells can be excluded from subsequent authentication.

The device can later be authenticated using a distance function that compares a single initial power-up data set for the device to the previously-computed identifier. The distance function compares the states in the single data set to the expected values in the identifier. For expected values that match the corresponding single data set value, a total is incremented according to the corresponding probability stored in the identifier. For cells that differ, the total is decremented by the probability. The total is then divided by the number of contributing cells, and the result is then compared to a threshold value to decide whether or not the device is authentic.

The foregoing and other objectives, features, and advantages of the invention will be apparent from the following, more particular, description of the preferred embodiment of the invention, as illustrated in the accompanying drawings.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING

The novel features believed characteristic of the invention are set forth in the appended claims. The invention itself, however, as well as a preferred mode of use, further objectives, and advantages thereof, will best be understood by reference to the following detailed description of the invention when read in conjunction with the accompanying Figures, wherein like reference numerals indicate like components, and:

FIG. 1A is an electrical schematic of a static storage cell that may be used to provide power-up state information used in embodiments of the present invention.

FIG. 1B is a block diagram depicting an integrated circuit that may be identified by techniques in accordance with embodiments of the present invention.

FIG. 2 is a pictorial diagram of a wafer test system in which methods in accordance with an embodiment of the present invention are performed.

FIG. 3 is a graph depicting variation of power-up state data with voltage and temperature conditions.

FIGS. 4A-4C are graphs depicting autocorrelation of bit-strings generated from representative samples of power-up state data.

FIGS. 5A and 5B are a table showing Hamming distances for device fingerprints for pairs of die within a set of 32 die.

FIG. 6 is a graph depicting unstable bit counts versus supply voltage and temperature.

FIG. 7 is a table showing Hamming distances for pairs of temperature-voltage conditions for a single die.

FIGS. 8A and 8B are Venn diagrams showing a relationship of voltage and temperature and cell stability.

FIG. 9 is a graph depicting a number of unstable bits for each of 32 die showing a number of bits having varying fingerprint values and unvarying fingerprint values over temperature and voltage variation.

FIGS. 10A and 10B are graphs depicting unstable bit behavior vs. temperature.

FIG. 11 is a diagram depicting a graph-theoretical model of device identification via fingerprint matching.

FIG. 12 is a graph depicting the behavior of an unstable bit for a sequence of power cycles.

FIG. 13 is a graph depicting the distribution of similarity measure for matching and mis-matched fingerprints.

FIG. 14 is a flowchart depicting a fingerprint generating method in accordance with an embodiment of the present invention.

FIG. 15 is a flowchart depicting a fingerprint matching method in accordance with an embodiment of the present invention.

FIGS. 16A and 16B are graphs depicting the similarity distribution for fingerprints of order 10 and 140, respectively, as generated by a method in accordance with an embodiment of the present invention.

FIG. 17A is graph depicting the similarity bound maximum and minimum versus fingerprint length and FIG. 17B is a graph depicting the similarity gap versus fingerprint length for fingerprints generated by a method in accordance with an embodiment of the present invention from data for a number of power supply voltages and environmental temperatures.

FIG. 18A is graph depicting the similarity bound maximum and minimum versus fingerprint length and FIG. 18B is a graph depicting the similarity gap versus fingerprint length for fingerprints generated by a method in accordance with an embodiment of the present invention from data for a nominal power supply voltage at two temperature extremes.

FIG. 19 is a block diagram depicting a system that may be secured by fingerprinting techniques in accordance with an embodiment of the present invention.

FIG. 20 is a block diagram depicting a system that may be secured by fingerprinting techniques in accordance with another embodiment of the present invention.

FIG. 21 is a block diagram depicting a processor that may be secured by fingerprinting techniques in accordance with an embodiment of the present invention.

DETAILED DESCRIPTION OF THE INVENTION

The present invention relates identification/authentication techniques for electronic devices. Given a sufficient number of static storage cells within a device, e.g., SRAM cells, latches, flip-flops, etc., a “fingerprint” can be generated that uniquely identifies the device from power-up initial values of the static storage cells. The signatures are generally determined at time of manufacture, as in the present invention, the fingerprint is made substantially temperature invariant by collecting power-up state data at multiple environmental temperatures for a sufficiently large number of power cycles at each of the temperatures. The fingerprint is generated from the collected data so that the device can be subsequently identified from a single power-up state set, although multiple power-up state sets can be collected to enhance the verification success rate. Temperature insensitivity of the fingerprint is ensured by excluding static storage cells for which the expected value of the power-up state varies with temperature, which can be accomplished by inserting a don't-care indication (e.g., a particular static value) in fields of the fingerprint that correspond to the cells with temperature-varying static values. Alternatively, multiple fingerprints (or a fingerprint encoding expected values for each temperature) can be provided and used in a sequential matching process that identifies a device by trying different fingerprints corresponding to different temperatures. The power supply voltage supplied to the device can also be studied and used to generate a fingerprint that is more accurate for devices having expected value variation with power supply voltage.

Referring now to FIG. 1A, an SRAM cell is shown that exemplifies static storage cells that can provide power-up state data used by embodiments of the present invention to generate a fingerprint. Transistors N1 and P1 form a first inverter and transistors N2 and P2 form a second inverter. The inverters are cross-coupled to form a static storage latch, which can then be connected to pass gates to form SRAM elements, or to other logic circuits to provide latch storage, flip-flop functionality, etc. Further, the SRAM cell illustrated in FIG. 1A is only exemplary and it is understood that the techniques of the present invention may be applied to devices having a different type (or multiple types) of static storage element. As long as the static storage elements exhibit varying power-up values on different die due to process variation, the statistical behavior of their power-up states can be used to generate a fingerprint, so that the device can later be authenticated.

Referring now to FIG. 1B, a device integrated circuit 10 is shown that may be fingerprinted and identified/authenticated by techniques according to the present invention. Integrated circuit 10 includes SRAM storage elements 14, which assume various initial states when power supply voltage is applied to integrated circuit 10. Integrated circuit 10 also includes scan logic 12 that is coupled to various functional units of integrated circuit 10, including read/write circuits 16, I/O interface 19 and row/column select (address) circuits 18. There are two mechanisms available for reading the initial states of SRAM storage elements 14. The first mechanism is via scan logic 12, and is most amenable to reading the power-up states at wafer-level test, since probing test pads and test system interfaces that connect to scan logic 12 are typically available for other purposes, including device pass/fail testing. The second mechanism is via I/O interface 19, which provides for the connection of integrated circuit 10 to other devices in a system into which integrated circuit 10 is installed. For example, integrated circuit 10 may form an SRAM device, e.g., a cache memory for use in a computer system, and I/O interface 19 may be the bus interface that connects integrated circuit 10 to a local processor bus. When I/O interface 19 is accessible, power-up state values of SRAM storage elements 14 may be read directly by a processor via I/O interface 19 to obtain the initial power-up states used to identify/authenticate integrated circuit 10. If integrated circuit 10 is the processor, and SRAM storage elements 14 form part of a cache memory, then I/O interface 19 may be an internal connection to a processor core. However, in other embodiments, scan logic 12 may also be the mechanism used to identify/authenticate integrated circuit 10, as in embodiments where the values of SRAM storage elements 10 are static latches not accessible via another interface. Also, I/O interface 19 may be the mechanism for obtaining the multiple power-up state data sets used to compute the fingerprint for integrated circuit 10, as in embodiments in which the fingerprint is obtained in the field or after singulation and encapsulation of a die on which integrated circuit 10 is formed.

Referring now to FIG. 2, a test system that may be used to obtain power-up state information from static storage cells within an integrated circuit and compute a fingerprint for the integrated circuit in accordance with techniques of the present invention is shown. A wafer tester 20 is coupled to a workstation computer system that includes a processor 26 coupled to a memory 27. Memory 27 stores program instructions and data, including program instructions for carrying out methods according to the present invention, which may be loaded into memory 27 from a fixed media such as CD-ROM CD via CD-ROM drive 25. A graphical display 29 is coupled to workstation computer system 28 for displaying results according to any of the computations described hereinafter, and to provide a user interface for input of control information to operate computer programs in accordance with embodiments of the present invention. Input devices, such as keyboard 24A and pointing device 24B are also coupled to workstation computer system 28 to provide for input of user commands. Wafer tester 20 includes a scan unit 21 that provides control information to, and receives data (including the power-up state data used by the present invention) from one or more die 22A integrated on a wafer 22 via test probe connections 23A to a test head 23. Wafer tester 20 also includes a programmable voltage supply PVS that is used to cycle power supply voltage (s) supplied to die 22A and to vary the power supply voltage in accordance with the techniques described below. A temperature chamber TC is provided to control the temperature of wafer 22 so that the power-up states of static storage elements integrated on die 22A can be studied for multiple environmental temperatures of die 22A.

The power-up behavior of SRAM cells exposes more information, when temperature dependencies are taken into account, than the probabilistic behavior assumed by previous techniques. The present invention expoits temperature variation information to provide a method for device identification that is reliable across a wide range of supply voltages and environmental temperature conditions. Below, in the Section labeled “Empirical Analysis”, experimental data are presented and analyzed that are based on more than 300,000 power-cycle experiments of 43 die with SRAM memories for 6 different supply voltages and at 6 different temperatures. The probabilistic model of fingerprints in the section below labeled “Probabilistic Fingerprint Model” provides the foundation for an embodiment of the present invention that is presented in the section below labeled “Practical Fingerprinting Method.”

Empirical Analysis

Measurements were performed on one wafer with 43 die, each with a 4,096-bit SRAM in 65 nm bulk technology. A temperature controlled probe-station was used to perform power-cycle experiments at temperatures (in degrees Centigrade):

T={25,50,75,100,125,150}.

For each temperature TεT, 200 power-cycle experiments were performed for each of supply voltages

V _(dd)={0.4V,0.6V,0.8V,1.0V,1.2V,1.4V},

by raising the power supply voltage V_(dd) rapidly to the specified value, providing an approximately 1 microsecond settling time for internal device states, and then adjusting power supply voltage V_(dd) to a nominal value of 1V before reading the bit-string of 4,096 power-up values from the static storage cells within the device. In the experimental setup, systematic errors that led to invalid test results were removed by discarding some of the data (e.g., data sets that yielded predominately one value and data sets in which the bit-strings were obviously shifted due to scan chain error). As a result, 81,045 power-up bit-strings were retained for 32 die, or a total of 1,024 data sets for a subset of (die×V_(dd)×T) rather than (die×V_(dd)×T)=32·6·6=1,152 data sets.

Distribution of Power-Up Values

Each power-up bit-string in the tests described above consisted of 4,096 binary values. If the power-up values were uniformly random distributed, an equal number of 0 values and 1 values would be expected, i.e., a count of 2048 of each value. Referring now to FIG. 3, a graph of the number of 1's is shown using a series of error-bars that bound the minimum and maximum number of 1's of the bit-strings for each (V_(dd),T)-condition. The (V_(dd),T)-conditions are sorted in ascending order of the mean number of 1's. From the Figure, it can be observed that: 1) the standard deviation of the number of 1's per (V_(dd),T)-condition is relatively small; and (2) that the average number of 1's varies substantially, by 32% of the bit-string length 4,096, across the (V_(dd),T)-conditions. Furthermore, the overall mean number of 1's is only 1,702, or 41.5% of the bit-string length, which appears to be due to process gradients.

Autocorrelation and Independence

The results for the distribution of 0's and 1's in the power-up bit-strings suggest that the power-up values might not be uniformly random distributed. We investigate the independence of the power-up events of individual bits in the bit-strings by computing autocorrelations, a standard procedure in signal processing. The autocorrelation coefficient of sample series X=[x₀, x₁, . . . , x_(n-1)] is

${{r(l)} = \frac{\sum\limits_{i = 0}^{n - 1}\; {x_{i}x_{i + {l\; {mod}\; n}}}}{\sum\limits_{i = 0}^{n - 1}\; x_{i}^{2}}},$

where l is the lag. The lag is the number of sample points by which the sample series X is cyclically shifted to the left. The autocorrelation coefficient quantifies by how much the products of pairs of sample points cancel each other. In the present analysis, X is a power-up bit-string. Thus, the x_(i) are power-up values having values {0,1}, and the autocorrelation coefficient r(l) lies in range [0,1]. Value r=1 indicates 100% autocorrelation in the same sense, and r=0 indicates 100% autocorrelation in the opposing sense, that is x_(i)≠x_(i+lmod n) for all i. Lag l plays the critical role in the computation of autocorrelations. In particular, we can use lag l to determine whether a series of sample points is independent by performing an autocorrelation for different lags. Only for lag l=0 should autocorrelation coefficient r(l) assume value 1. For all other lags 0<l<n, the autocorrelation coefficient should be near 0.5, if the 0's and 1's are uniformly distributed.

Referring now to FIGS. 4A-4C, autocorrelation coefficients for three representative sample bit-strings, chosen from the left and right boundaries of the data displayed in FIG. 2, are shown. FIG. 4A shows the autocorrelation coefficient versus lag for a sample containing with 28% of the power-up bit states having a value of “1”, and FIG. 4B and FIG. 4C show the same for samples having 42% “1” bits and 58% “1” bits, respectively. FIG. 4B shows autocorrelation behavior for data having a number of “1” bits that is close to the overall mean number of “1” bits in the analysis. For all samples, autocorrelation coefficient r(l) is equal to the ratio of the number of “1” bits in the bit-string for most lags. However, for lag values l=1, 2 4094 and 4095, autocorrelation coefficient r(l) is slightly greater, indicating that the power-up values of individual bits are not entirely independent. Instead, pairs of neighboring bits appear to affect each other's power-up values. Nevertheless, the neighboring cell dependence is relatively small, and the power-up values can generally be viewed as independent.

Hamming Distance

Another indicator of the distribution of 0's and 1's in the power-up bit-strings is the Hamming distance between two bit-strings. The Hamming distance of two bit-strings X=[x₀, x₁, . . . , x_(n-1)] and Y=[y₀, y₁, . . . , y_(n-1)] of equal length n is the number of differing bits, formally defined as:

$\begin{matrix} {{{HD}\left( {X,Y} \right)} = {\sum\limits_{i = 0}^{n - 1}\; {\left\lbrack {x_{i} \neq y_{i}} \right\rbrack.}}} & (1) \end{matrix}$

using Iverson's APL notation for predicate A:

$\lbrack A\rbrack = \left\{ \begin{matrix} 1 & {{{if}\mspace{14mu} A\mspace{14mu} {is}\mspace{14mu} {true}},} \\ 0 & {{otherwise}.} \end{matrix} \right.$

FIGS. 5A and 5B present a table that lists the Hamming distances between the SRAM fingerprints for all pairs of die. For each die, an SRAM fingerprint was computed as described above by including the bit-strings of all power-cycle tests for all (V_(dd),T)-conditions. If the power-up state values of the static storage cells were uniformly distributed, a Hamming distance of HD(X,Y)=n/2, would be expected, because half of the possible events (x₁,y₁)ε{(0,0), (0,1), (1,0), (1,1)} have differing bits. The results shown in the table of FIGS. 5A and 5B for n=4,096 indicate that the actual Hamming distances are relatively close to the ideal value of 2,048, with an overall mean of mean (HD)=1979.4. More importantly, the tables show that SRAM fingerprints of length n=4,096 do clearly identify each of the 32 die included in the tests by means of Hamming distances.

Voltage and Temperature

Referring now to FIG. 6, results from power-cycle experiments for one die based on a total of 2,097 bit-strings, or about 58 power-up bit-strings (at least 49) per (V_(dd),T)-condition. Each bit-string contains 4,096 bits. FIG. 6 shows the number of unstable cells for each set of power-up measurements per (V_(dd),T)-condition. (For each supply voltage condition, the bars are ordered from the lowest temperature condition to the highest from left to right.) Note that cells may be stable within multiple (V_(dd),T)-conditions, but have different power-up values across (V_(dd),T)-conditions. FIG. 6 does not show this phenomenon, which is discussed in further detail below. From FIG. 6, two observations are made:

1. The power-up state values depend on temperature, where an exponential decrease in the number of unstable cells is observed with increasing temperature.

2. The power-up state values are substantially independent of applied supply voltage. (As a second order effect, a slight increase in the number of unstable cells is seen with increasing voltage at a given temperature.)

Referring now to FIG. 7, a different perspective on the data shown in FIG. 6 is presented in a table. The table of FIG. 7 shows Hamming distances of SRAM fingerprints that have been derived for each (V_(dd),T)-condition. The dominance of the temperature dependence is also visible in the table. For each voltage, the Hamming distance increases with increasing temperature. Comparatively, the Hamming distance increases only modestly for increasing voltage at the same temperature. Additional information about the voltage and temperature dependence of the power-up values can be gained from plotting a Venn diagram of the unstable cells. FIGS. 8A and 8B show the cardinalities of subsets of cells that are unstable under certain voltage and temperature conditions. For the benefit of clarity, in each Figure, only a subset of three different voltages or temperatures are shown. FIG. 7A shows that 779 cells are unstable at T=25, another 120 cells are unstable at both temperatures T=25 and T=100, and other counts as shown in the remainder of FIG. 7A. It is observed that only 14 cells are unstable at all three temperatures, indicating that different temperatures effect the stability of different sets of cells. In contrast, for the voltage dependence Venn diagram shown in FIG. 7B, most of the unstable cells (778) are unstable at all voltages. Only a small fraction is unstable for any one particular voltage, such as the count of 50 cells shown for V=0.4V. Therefore, to the first order, power-up values depend on the temperature, but not on supply voltage, and a substantial portion of the memory cells, up to about 20%, become unstable at lower temperatures. Furthermore, different cells become unstable at different temperatures.

Temperature Dependence of Fingerprints

The temperature dependence of SRAM fingerprints, in particular the exponential behavior exhibited in FIG. 6 is likely due to a fundamental property of the SRAM circuit of a memory cell. Therefore it is desirable to determine how much of the instability of power-up values observed in the above-described tests is due to noise, meta-stability, or statistical effects due to process variation, as opposed to deterministic effects caused by operating temperature. On an empirical basis, if the fingerprints for each (V_(dd),T)-condition of a die are considered, as shown in the table of FIGS. 7A-7B, the contribution of environmental noise can be evaluated. A fingerprint is computed from the set of bit-strings measured at a particular (V_(dd),T)-condition. At one (V_(dd),T)-condition, some of the cells are unstable, which may be attributed to non-deterministic effects. All stable cells assume a unique power-up value, which cannot be attributed to non-deterministic effects. For a different (V_(dd),T)-condition, the same cells may be stable as well, but can assume a different power-up value. Without distinguishing the (V_(dd),T)-conditions, the cells assuming a difference power-up value would be considered unstable. Referring now to FIG. 9, the number of unstable cells is shown for each of the 32 die, and are separated into those with different fingerprint values under different (V_(dd),T)-conditions (upper portion of bars) and those where the fingerprint value is the same for all (V_(dd),T)-conditions (lower portion of bars). It is observed from FIG. 9, that, on average, only about 23% of all unstable cells yield the same fingerprint value for each (V_(dd),T)-condition. Most of the cells yield different fingerprint values for different (V_(dd),T)-conditions, and, it appears that those cells are stable for individual (V_(dd),T)-conditions. The above results indicate that environmental noise is a minor contributor to the instability of power-up values.

Closer examination of the fingerprint bit values of individual cells shows that unstable cells exhibit a distinguished temperature dependence. Either, the fingerprint bit value is “0” for low temperatures and “1” for high temperatures, or vice versa. Referring to FIGS. 10A and 10B, the behavior of individual cells is shown, indicating a crossover temperature, below and above of which the cell assumes a different power-up value. FIG. 10A illustrates the behavior of a cell having a nominal (expected) power-up value of “1” for temperature conditions below 112° C. and a nominal power-up value of “0” above 112° C. Similarly, FIG. 10B illustrates the behavior of a cell having a nominal power-up value of “0” for temperature conditions below 62° C. and a nominal power-up value of “0” above 62° C. Since the power-up behavior of the unstable cells is not deterministic, FIGS. 10A and 10B are intended to illustrate a probabilistic (expected value) behavior and not a consistent behavior of the exemplary cells. Analysis of power-up data reveals that 90% of all unstable cells with varying fingerprint value exhibit the temperature dependence illustrated in FIGS. 10A and 10B. The remaining 10% toggle their fingerprint value more than once within the temperature range. A SRAM cell is generally a cross-coupled amplifier as illustrated in FIG. 1A, which exhibits positive feedback and regenerates the smallest of differential voltages. At the initial application of power supply voltage, the complementary outputs of an ideally symmetric cell will both rise to V_(dd)/2, which is the high-gain region, and the regeneration process will begin. An ideally symmetric SRAM cell maintains the meta-stable state where the voltages remain at V_(dd)/2. Otherwise, the slightest voltage difference will be amplified and the cell latches one of the two possible power-up values. A voltage difference can arise from a number of factors which can be classified into four categories:

(1) Environmental effects including supply voltage and temperature. Because the cell is differential the effects for this category are generally negligible in the fingerprinting result. (2) Process variation, which affects threshold voltage, mobility, contact resistance, etc. The effects of process variation will bias the cell in one direction indefinitely. Strong shifts in process variation are responsible for stable bits. (3) Random noise. Random events such as thermal noise, flicker noise, etc. can appear in the high gain region and trigger the cell to regenerate. Noise events are responsible for highly unstable bits. (4) Environmental effects on process variation. The effects of process variation are highly dependent on temperature and voltage. Hence, the bias due to process variation can be weaker or stronger depending on the environmental conditions. For example, the gain of a transistor depends on the threshold voltage and the mobility. The threshold voltage depends nonlinearly on temperature and linearly on mobility. Consequently, a cell's regeneration polarity can change as a function of temperature, causing the cell to be unstable. Furthermore, a cell can be stable for two different temperatures but assume complementary power-up values.

Temperature dependency of the effects of process variation on the electrical power-up behavior of a SRAM cell can be analyzed in detail as follows. Assuming a perfectly symmetric SRAM cell except for asymmetries in the two pull-down n-FETs (e.g., transistors N1 and N2 in FIG. 1A) due to process variation, the effect of temperature on process variation can be analyzed. A further assumption is that there exists a voltage and temperature where both n-FETs drive equal currents, causing the cell to become meta-stable when powering up. Below are the equations for the three relevant quantities for long-channel devices. (Short-channel devices can be treated by an analogous analysis). The drain current is:

I _(DS)∝(V _(GS) −V _(th))²,

where V_(GS) is the voltage from gate to source, which is assumed to be temperature independent. The threshold voltage is

V _(th)(T)=V _(th)(T ₀)−κ(T−T ₀),

where T₀ is 300 K, κ is the temperature coefficient of the threshold voltage (typical κ=2.4 m V/K), and T is temperature. The mobility is

${\mu (T)} \propto {{\mu \left( T_{0} \right)}{\frac{T}{T_{0}}.}}$

Substituting the expressions for V_(th)(T) and μ(T) gives us an expression for the drain current

$\begin{matrix} {{I_{DS} \propto {\frac{\mu}{T}\left( {v + {\kappa \; T}} \right)^{2}}},{{where}\mspace{14mu} \mu},v,{\kappa \mspace{14mu} {are}\mspace{14mu} {{constant}.}}} & (2) \end{matrix}$

Assuming that constant κ is not affected by process variation but that constants κ and ν are, for each of the two n-FETs, we have constants μ₁, ν₁ and μ₂, ν₂, and μ₁≠μ₂ and ν₁≠ν₂ in general. By definition, the drain currents through the n-FETs of a SRAM cell are the same in the meta-stable state. Hence, the following condition holds:

$\begin{matrix} {{\frac{\mu_{1}}{T}\left( {v_{1} + {\kappa \; T}} \right)^{2}} = {\left. {\frac{\mu_{2}}{T}\left( {v_{2} + {\kappa \; T}} \right)^{2}}\Leftrightarrow\frac{\mu_{1}}{\mu_{2}} \right. = {\left( \frac{v_{2} + {\kappa \; T}}{v_{1} + {\kappa \; T}} \right)^{2}.}}} & (3) \end{matrix}$

Therefore, it can concluded that in the meta-stable state at temperature T due to process variation, if ν₁<μ₂ then ν₁>ν₂, and if μ₁>μ₂ then ν₁<ν₂.

Assuming the same SRAM cell is operated at temperature T′≠T, according to Equation 2 above, the currents through n-FET 1 and n-FET 2 are I_(DS,1)′∝μ₁/T′(ν₁+κT′)² and I_(DS,2)′∝μ₂/T′(ν₂+κT′)². Substituting μ₁ according to Equation 3 above, the following relation results:

$\begin{matrix} {{I_{{DS},1}^{\prime} \propto {\frac{\mu_{2}}{T^{\prime}}\left( \frac{v_{2} + {\kappa \; T}}{v_{1} + {\kappa \; T}} \right)^{2}\left( {v_{1} + {\kappa \; T^{\prime}}} \right)^{2}}} = {{I_{{DS},2}^{\prime}\left( {\frac{v_{2} + {\kappa \; T}}{v_{1} + {\kappa \; T}} \cdot \frac{v_{1} + {\kappa \; T^{\prime}}}{v_{2} + {\kappa \; T^{\prime}}}} \right)}^{2}.}} & (4) \end{matrix}$

Introducing name α in the ratio I_(DS,1)′/I_(DS,2)′=α², the circuit structure imposes α>0. In addition, introducing voltage shift ν_(s) such that ν₂=ν₁+ν_(s), the following relations result:

$\begin{matrix} \left. {\alpha > 1}\Leftrightarrow{\frac{v_{2} + {\kappa \; T}}{v_{2} + {\kappa \; T^{\prime}}} > \frac{v_{1} + {\kappa \; T}}{v_{1} + {\kappa \; T^{\prime}}}}\Leftrightarrow{\frac{v_{1} + {\kappa \; T} + v_{s}}{v_{1} + {\kappa \; T^{\prime}} + v_{s}} > \frac{v_{1} + {\kappa \; T}}{v_{1} + {\kappa \; T^{\prime}}}}\Leftrightarrow{{v_{s}\left( {v_{1} + {\kappa \; T^{\prime}}} \right)} > {{v_{s}\left( {v_{1} + {\kappa \; T}} \right)}.}} \right. & (5) \end{matrix}$

Since ν_(s)>0 implies ν₂>ν₁, it can be concluded from Equation 5 that ν₂>ν₁

T′>T and ν₂<ν₁

T′<T. The converse results are obtained for α<1. Furthermore, α>1 implies I_(DS,1)′>I_(DS,2)′, and the following relations are obtained:

I _(DS,1) ′>I _(DS,2)′ if T′>T,  (6)

I _(DS,1) ′>I _(DS,2)′ if T′>T.  (7)

For ν₂<ν₁ the converse relations of those for ν₂>ν₁ above hold. Relations (6) and (7), above, can be interpreted such that for temperature T′, one of the n-FETs drives a stronger current than the other. The imbalance in current causes the feedback loop to latch a stable state. Therefore, there exists just one inflection point at meta-stability temperature T. If the temperature is T′<T the cell will latch one state, and for T′>T the cell latches the complementary state. The above analysis explains the behavior illustrated in FIGS. 10A and 10B. Since the amplification of a SRAM cell is significant, the sensitivity of the crossover temperature is very high, and can be approximated by a step function. Thus, the observed random behavior of power-up values can be reduced primarily to process variation. A SRAM cell is either stable if process variation introduces a strong bias in the drive strengths, or it causes the step-function behavior illustrated in FIGS. 10A and 10B, in which the crossover temperature depends on the process variation.

Probabilistic Fingerprint Model

Elementary probability theory can be used to model the fingerprints of a SRAM. In particular, a probabilistic similarity measure can be introduced and a lower bound on the length of fingerprints can be determined therefrom.

Graph-Theoretical Problem Description

The problem of identifying a chip by means of a SRAM fingerprint can be represented abstractly using a graph, as shown in FIG. 11. Assume we have three entities (particular die in the example) X, Y, and Z with fingerprint samples X₁, X₂, X₃, Y₁, Y₂, etc. It is desirable to generate a representative fingerprint for each entity, such as FX=fingerprint(X) for entity X, that can identify the die. Fingerprints generated according to existing techniques that do not consider temperature dependence provide such a representative fingerprint, but more apt choices are possible, an example of which is described below. To identify entity X using its corresponding fingerprint X₂ fingerprint X₂ is compared with representative fingerprints FX FY, and FZ, to obtain a positive identification for entity X and negative identifications for entities Y and Z. A graph G=(V,E), illustrated by FIG. 11, consists of the set of vertices V comprising the sample and representative fingerprints. Restricting the analysis to fingerprints in form of bit-strings and finite numbers of entities and samples, the set of edges E represents the fingerprint matching function M such that edge (u,v) between vertices u and v exists if M(u,v) detects a match between the associated fingerprints. Matching function M(u,v) shall detect matches and mismatches such that the following edges exist or do not exist, respectively:

(X_(i),FX) exists: sample X_(i) matches fingerprint FX

(X_(i),FY) does not exist: sample X_(i) mismatches fingerprint FY

If these two types of matches and mismatches were the only relations between fingerprints and samples, the graph would be a “forest” of star-subgraphs with a representative fingerprint in the center of each star. Each sample fingerprint would be associated uniquely with its representative fingerprint by the solid lines in FIG. 11. If edge (X_(i),FX) does not exist, such as (Y₂,FY) in FIG. 11, a false negative is present that does not identify a legitimate sample X_(i) with its representative fingerprint FX. To the contrary, a false positive identifies a sample with the wrong representative fingerprint, e.g., edges (Y₂, FZ) and (Z₁, FY). Note that false positives of different types occur in FIG. 11. Sample Z₁ matches both representative fingerprints FZ and FY. In contrast, sample Y₂ matches FZ only. In the former case, the false positive can be detected, in the latter case it cannot. The following list summarizes the different types of edges beyond matches and mismatches:

-   -   (X_(i),FX)does not exist: false negative.     -   (X_(i),FY) exists: false positive.     -   (X_(i),X_(j)) exists: sample X_(i) matches sample X_(j), e.g.         edges (X₁, X₂), (X₂≦X₃),     -   (X₃,X₁) in FIG. 11; match considered normal.     -   (X_(i),Y_(j)) exists: sample X_(i) matches sample Y_(j), cf.         edge (X₃,Y₁) in FIG. 11; match indicates that fingerprinting         method may be incapable of identification.     -   (FX,FY) exists: representative fingerprint FX matches         representative fingerprint FY, cf. edge (FY,FZ) in FIG. 11;         representative fingerprints do not identify associated entities         uniquely.         The goal of the design of an identification scheme based on         fingerprints is to generate representative fingerprints together         with a matching function such that none of the cases above         except edges (X_(i),X_(j)) will occur. In a probabilistic         version of the identification scheme, it is desirable to         maximize the probability of matching and mismatching samples         with representative fingerprints and minimize the probability of         the occurrence of the undesirable cases above.

Power-Up Values of an SRAM Cell

When power is initially applied to an SRAM (or other device containing static storage cells), each 1-bit cell assumes a binary value, the power-up value xε{0,1}. Subscript i is used in the following to refer to the value of a cell of index i as x_(i). A cell is considered stable if it assumes the same value after each power cycle, and unstable if its value changes. It can be assumed that every memory cell is eventually unstable, that is over a large number of power cycles different power-up values will be obtained. However, for practical purposes, the observation period is limited, and therefore some cells will appear to be stable. FIG. 12 illustrates the power-up value of an unstable cell during a series of power cycles. The mean x _(i) of power-up values x_(i)(k) of unstable cell i over m power cycles is:

$\begin{matrix} {{{\overset{\_}{x}}_{i} = {\frac{1}{m}{\sum\limits_{k = 1}^{m}\; {x_{i}(k)}}}},} & (8) \end{matrix}$

in which k denotes the power cycle. For example, the mean of the power-up values x(k) in FIG. 12 over m=18 power cycles is x= 6/18. The expected power-up value E(x_(i)) of unstable cell i by thresholding can be defined as:

$\begin{matrix} {{E\left( x_{i} \right)} = \left\{ \begin{matrix} {0,} & {{{if}\mspace{14mu} {\overset{\_}{x}}_{i}} < 0.5} \\ {1,} & {{{if}\mspace{14mu} {\overset{\_}{x}}_{i}} \geq {0.5.}} \end{matrix} \right.} & (9) \end{matrix}$

Based on 18 power cycles, the expected power-up value of the cell exemplified by FIG. 12 is E(x)=0. Next, the outcome of a power-cycle test, the power-up event Q(x_(i)) of cell i, can be defined as a random variable:

$\begin{matrix} {{Q\left( x_{i} \right)} = \left\{ \begin{matrix} {1,} & {{{if}\mspace{14mu} x_{i}} = {E\left( x_{i} \right)}} \\ {0,} & {{{if}\mspace{14mu} x_{i}} \neq {{E\left( x_{i} \right)}.}} \end{matrix} \right.} & (10) \end{matrix}$

Cell i assumes the expected power-up value if the outcome of a power-up experiment results in Q(x_(i))=1, and cell i assumes the unexpected power-up value if Q(x_(i))=0. For the graph depicted in FIG. 12, the unexpected power-up value is given by Q(x(k))=0 during power cycles kε{3, 5, 6, 7, 10, 16}, and the expected power-up value is given by Q(x(k))=1, for the remaining power cycles. The sample mean of m power-up events

$\begin{matrix} {{\overset{\_}{Q}\left( x_{i} \right)} = {\frac{1}{m}{\sum\limits_{k = 1}^{m}\; {Q\left( {x_{i}(k)} \right)}}}} & (11) \end{matrix}$

characterizes the stability of the power-up values of a memory cell. If Q(x_(i))=1, then memory cell i is stable. On the other hand, if Q(x_(i))=½ (for even m), then the cell at position i is “maximally” unstable, in other words, the power-up value of the cell toggles either from 0 to 1 or vice versa during each power cycle. For half of the power cycles (every other cycle) the power-up value equals the expected power-up value, and during the other half they differ. In fact, Q(x_(i)) lies in range [½, 1]. In order to denote the probability that cell i assumes its expected power-up value, the sample mean is scaled to cover range [0,1] according to:

q(x _(i))=2 Q( x _(i))−1.  (12)

Thus, if cell i is stable q(x_(i))=1, and if cell i is “maximally” unstable q(x_(i))=0. The probability that cell i assumes its expected power-up value is therefore 0. In other words, q(x_(i))=0 indicates that cell i has an indeterminate expected power-up value.

According to the law of large numbers, a sample mean approaches the true mean for large numbers of power cycles. Thus, by gathering power-up values over a sufficiently large number of power cycles, the probability of the expected power-up event can be determined empirically. In practice, 20 samples are often considered sufficient. For example, for the fingerprint in Table I in the Background, there are m=8 samples from which the expected power-up values are computed, and the probabilities that each bit will assume its expected power-up value are given as follows:

TABLE II i 0 1 2 3 4 5 E (x_(i)) 0 1 1 0 1 1 q (x_(i)) ¾ 1 ½ 1 1 0

Similarity Measure

The present invention uses a new similarity measure for fingerprints having probabilistic bit-strings. One benefit of the similarity measure of the present invention is that it permits smaller fingerprint lengths for chip identification than when using the Hamming distance. The distance D(X,Y) between two fingerprints X=[x₀, x₁, . . . , x_(n-1)] and Y=[y₀, y₁, . . . , y_(n-1)], both of length n, and probabilities q(x_(i)) and q(y_(i)) for 0<n given by Equation 12 above can be defined as:

$\begin{matrix} {{D\left( {X,Y} \right)} = {{\sum\limits_{i = 0}^{n - 1}\; {\left\lbrack {x_{i} \neq y_{i}} \right\rbrack {q\left( x_{i} \right)}{q\left( y_{i} \right)}}} - {\sum\limits_{i = 0}^{n - 1}\; {\left\lbrack {x_{i} = y_{i}} \right\rbrack {q\left( x_{i} \right)}{q\left( y_{i} \right)}}} + {n.}}} & (13) \end{matrix}$

The distance accounts for both mismatches and matches, but with opposite signs. It is noted that

D(X,Y)=2HD(X,Y) iff ∀i:q(x _(i))=q(y _(i))=1.

Since the Hamming distance HD(X,Y), is a metric, distance measure D(X,Y) is a metric if the probabilities of all cells assuming their expected power-up values q(x_(i)) and q(y_(i)) are 1. Metric spaces are useful constructs, because they permit efficient searching, among other benefits. Defining the similarity S(X,Y) between fingerprints X and Y, both of length n, as

$\begin{matrix} {{{S\left( {X,Y} \right)} = {1 - \frac{D\left( {X,Y} \right)}{n}}},} & (14) \end{matrix}$

the similarity is bounded by −1≦S(X,Y)≦1, because the distance is bounded by 0≦D(X,Y)≦2n. Furthermore,

${S\left( {X,Y} \right)} = \left\{ \begin{matrix} 1 & {{{{if}\mspace{14mu} X} = {{Y\mspace{14mu} {and}\mspace{14mu} {q\left( x_{i} \right)}} = {{q\left( y_{i} \right)} = {1\mspace{14mu} {for}\mspace{14mu} {all}\mspace{14mu} i}}}},} \\ 0 & \begin{matrix} {{{{if}\mspace{14mu} {{HD}\left( {X,Y} \right)}} = {{\frac{1}{2}\mspace{14mu} {and}\mspace{14mu} {q\left( x_{i} \right)}} = {{q\left( y_{i} \right)} = {1\mspace{14mu} {for}\mspace{14mu} {all}\mspace{14mu} i}}}},} \\ {{{{or}\mspace{14mu} {if}\mspace{14mu} {\forall{i:{q\left( x_{i} \right)}}}} = {{0\bigvee{q\left( y_{i} \right)}} = 0}},} \end{matrix} \\ {- 1} & {{{if}\mspace{14mu} X} = {{{Y\mspace{14mu} {and}\mspace{14mu} {q\left( x_{i} \right)}}} = {{q\left( y_{i} \right)} = {1\mspace{14mu} {for}\mspace{14mu} {all}\mspace{14mu} {i.}}}}} \end{matrix} \right.$

Hence, S(X,Y) can be interpreted under the assumption that q(x_(i))=q(y_(i))=1, such that equal fingerprints have similarity S=1, complementary strings with x_(i)≠y_(i) for all i have similarity S=−1, and strings with Hamming distance HD(X,Y)=½ have similarity S=0. In addition, the similarity measure enables comparison of fingerprints when the probabilities are less than 1. In particular, if q(x_(i))=0 or q(y_(i))=0, that is at least one of x_(i) or y_(i) has an indeterminate expected power-up value, then the similarity is S=0. Thus, S=0 indicates that two strings have Hamming distance HD(X,Y)=½ with power-up probabilities 1 or the power-up probabilities are 0 for at least one of the bits, yielding no information about the associated power-up value. For the purposes of fingerprint identification, there is no requirement to distinguish these two cases when the goal is to identify matches and mismatches independent of whether the mismatch is due to mismatching fingerprints with high power-up probabilities or whether the power-up probabilities are too low to permit identification.

Device Identification

The identification of a particular electronic device having static storage cells can be carried out by the similarity measure defined in Equation 14 above. Assuming that the expected power-up bit-string E(X) is used as the representative fingerprint FX=E(X), to identify a sample bit-string X with a chip with fingerprint FX, and to distinguish it from all other chips with distinct representative fingerprints, two conditions must be fulfilled for a sample bit-string X:

${S\left( {X,{FX}} \right)}\left\{ \begin{matrix} {{\approx 1},} & {{{{if}\mspace{14mu} {FX}} = {{fingerprint}(X)}},} \\ {{< 1},} & {{{if}\mspace{14mu} {FX}} \neq {{{fingerprint}(X)}.}} \end{matrix} \right.$

The matching condition given above demands that the similarity of sample X compared to its representative fingerprint FX is much closer to 1 than to 0, that is that the bit-strings are equal with a relatively small number of exceptions. Therefore, it is necessary to know the power-up values of a majority of cells with high confidence, that is q(x_(i))≈1. In contrast, the mismatching condition demands that the similarity of sample X compared to all other representative fingerprints FY in the sample space is closer to 0 than to 1. This result should be a consequence of the fact that the Hamming distance between the sample and the representative fingerprints of all other chips Y is HD(X,FY)≈n/2 with reasonably high probability. In practice, a similarity threshold T_(S) is chosen that determines the quality of the identification. For example, if threshold T_(S)=½ is chosen, then S(X,FY)≧½ is interpreted as a positive identification, and therefore X is a sample string from a device having representative fingerprint FY=FX·S(X,FY)<½ is then interpreted as a negative identification. Besides these two distinct identification decisions, there are two possible erroneous identification outcomes. If X is a sample string from a chip with representative fingerprint FX, and FY is the representative fingerprint from a different chip, the following decisions can be made, according to Table III.

TABLE III MATCH FALSE POSITIVE S(X, FX) ≧ T_(S) S(X, FY) ≧ T_(S) FALSE NEGATIVE MISMATCH S(X, FX) < T_(S) S(X, FY) < T_(S) In the table, a match indicates that sample X correctly identifies the associated chip, and a mismatch indicates that the chip with fingerprint FY is not associated with sample X. A false positive incorrectly associates sample X with a chip with fingerprint FY, and a false negative incorrectly fails to associate sample X with the chip with fingerprint FX. A similarity measure is a reasonable means for chip identification only if the number of false positives and negatives is negligible, and the matches and mismatches identify chips correctly.

FIG. 13 illustrates the effectiveness of the above-described similarity measure according to a pair of distributions for match and mismatch conditions, in which:

S ₁=min(S _(mismatch))≦S _(mismatch)≦max(S _(mismatch))=S ₃ and

S ₂=min(S _(match))≦S _(match)≦max(S _(match))=S ₄.

The similarity measure of sample fingerprints and representative fingerprints follows a distribution similar to the one illustrated in FIG. 13. Sample fingerprints X from a chip associated with representative fingerprint FX match this fingerprint, and exhibit a larger similarity value S(X,FX) than those for other chips with representative fingerprint FY and mismatching similarity S(X,FY). Due to the probabilistic nature of the fingerprints, the similarity values can be characterized by distributions. In general, the similarity distributions of the matching and mismatching fingerprints overlap. For our similarity measure to always permit correct chip identification, we need to have max(S_(mismatch))<min(S_(match)). Then, we can determine the threshold T_(S), for example, to be the mean T_(S)=(max(S_(mismatch))+min(S_(match)))/2.

Birthday Paradox and Fingerprint Length

The quality of the chip identification scheme is directly related to the string length n of the fingerprints. In fact, a lower bound for the number of bits n can be derived from the well-known birthday paradox. The simple analysis of the birthday paradox introduces a random variable X_(ij) which assumes value 1 if two people i and j have the same birthday in a year of D days. Then the expectation of two people having the same birthday is E(X_(ij))=1/D, and the expectation that two out of N people share one birthday is N(N−1)/(2D). Consequently, if the number of people is approximately N≧√{square root over (2D)}, we expect two people to share the same birthday. From our perspective of chip identification we are interested in the converse case. Assuming we have N chips with associated fingerprints, the size of the space of fingerprints such that no two fingerprints are the same can be determined. If a fingerprint is a bit-string of length n, there exist 2^(n) distinct fingerprints. Thus, according to the birthday paradox, no two fingerprints out of N are expected to be the same if N<√{square root over (2^(n+1))}, or the number of bits in a fingerprint should be n>2 ln N−1. The table below shows the (rounded) minimum expected string length n of a fingerprint for various numbers of chips N.

Chips N 10³ 10⁶ 10⁹ 10¹² Fingerprint 20 40 60 80 length n (birthday paradox) Fingerprint 200 400 600 800 length (recommended) While analysis of the birthday paradox is based on the expectation of one shared birthday, it is of interest to minimize the probability of sharing a fingerprint. Therefore, the number of bits in a fingerprint should be required to be n>>2 ln N−1 to increase the quality of our identification scheme. Our empirical results below suggest that about ten times as many bits, that is about 20 lg N bits, constitute a reasonable choice. Fingerprint lengths of hundreds of bits can be considered practical, given that typical numbers of SRAM cells on todays' chips exceed this count comfortably.

Practical Fingerprinting Method

A fingerprinting method in accordance with an embodiment of the present invention is described below, which is robust across a wide range of supply voltages and temperatures. Given the assumption that over a large number of power cycles every SRAM cell will be unstable and in particular, that under different supply voltages and temperatures, every cell can assume a power-up value different from its expected power-up value, the fingerprinting technique of the present invention can be made more reliable. Each static storage cell is associated with the pair of the expected power-up value and the probability of assuming this value (E,q) as defined in Equations 9 and 12 above. To account for the temperature dependence, the fingerprint generation technique in accordance with an embodiment of the present invention described below includes refinements as follows. A device identifier (CID) can be defined as follows: For n SRAM cells and m₀ power-cycle experiments at environmental conditions (V_(dd),T₀) and m₁ power-cycle tests at (V_(dd),T₁), an identifier (CID) can be determined as the sequence of pairs CID=[E(x₀),q(x₀)),(E(x₁),q(x₁), . . . , (E(x_(n-1)),q(x_(n-1)))],such that

$\left( {{E\left( x_{i} \right)},{q\left( x_{i} \right)}} \right) = \left\{ \begin{matrix} \left( {0,0} \right) & {{{{if}\mspace{14mu} {E\left( {x_{i},T_{0}} \right)}} \neq {E\left( {x_{i},T_{1}} \right)}},} \\ \left( {{E\left( x_{i} \right)},\frac{{m_{0}{q\left( {x_{i},T_{0}} \right)}} + {m_{1}{q\left( {x_{i},T_{1}} \right)}}}{m_{0} + m_{1}}} \right) & {{otherwise},} \end{matrix} \right.$

where T₀ is a lower temperature, such as 25° C., T₁ is a higher temperature, such as T₁≈150° C. and V_(dd) is the nominal supply voltage. The characterization identifier given above excludes those unstable cells having power-up values that exhibit the temperature dependence shown in FIGS. 10A and 10B by asserting probability q(x_(i))=0 for those cells, thereby inserting a static value that excludes the cells from later considerations of device similarity. According to the power-cycle tests described above, the number of these cells is about 20% of the total number of cells. The exclusion of the temperature dependent cells is compensated-for by increasing the length n of the CID, as described in further detail below. The characterization identifier (CID) serves as the representative fingerprint of a device. To identify a device, a single sample fingerprint of a power-cycle test can be obtained, referred to as the field identifier (FID), and matched against a repository of CID's of all manufactured chips. Since the FID consists of a single sample, it is assumed that q(x_(i))=1 for all i, such that FID=[(x₀,1), (x₁,1), . . . , (x_(n-1),1)]. The matching operation consists of computing for each CID the similarity measure S(FID, CID) of Equation 14 above, and makes the decision whether the FID matches the CID by evaluating the predicate S(FID,CID)≧T_(S) for a similarity threshold T_(S). If the predicate is true, a match is found. The same procedure may be used to authenticate a device by comparing its FID against a single CID. For more sophisticated applications an encrypted exchange of the FID or multiple challenge-response pairs of randomly selected bits for the FID can be employed.

The threshold T_(S) for the identification procedure depends on the length n of the fingerprints. Length n is chosen based on the number of chips N that are to be distinguished and the stability of the power-up values, which is determined by the variation in the fabrication process. In the present embodiment T_(S) is selected by sub-sampling the results of the power-cycle experiments used to determine the CID's of a number of chips. Each sub-sample selects, at random, a number of n bits out of n_(cells) memory cells, n≦n_(cells). Then, the associated power-up values are used as an n-bit FID, and the similarity measure is computed against the corresponding sample subset of bits of all CID's. The fingerprint length n is large enough if for all samples max(S_(mismatch))<min(S_(match))·T_(S) can then be chosen as the mean T_(S)=(max(S_(mismatch))+min (S_(match)))/2 as described above. The central limit theorem guarantees that with increasing fingerprint length n, the similarity distributions will be closer to their expected values. Therefore, max(S_(mismatch)) will become smaller and min(S_(match)) 1 will become larger with increasing n, and a gap for a valid T_(S) should emerge.

Characterization

A method of device characterization in accordance with an embodiment of the present invention is depicted in FIG. 14. First, the device is stabilized at a particular (V_(dd),T) condition (step 70) and the initial states of the static storage cells in the device are read (step 71). As noted above, and as justified by results of tests performed on 32 experimental die as will be described in further detail below, only two temperature conditions at a fixed (nominal) voltage condition are needed to provide a CID. However, the present invention contemplates CIDs computed from any number of (V_(dd),T) conditions, and for some circuit designs and processes there may be an advantage to including additional (V_(dd),T) above and beyond tests at nominal voltage and at the extremes of the operating temperature range for a device. Steps 70 and 71 are repeated for multiple power-cycles (e.g., m>20) at each (V_(dd),T) condition to be tested by removing the applied power supply voltage and restoring it to cycle power (step 73) until the last power cycle for the current (V_(dd),T) has been performed (decision 72). Until tests for the last (V_(dd),T) condition have been completed (decision 74), steps 70-73 are repeated multiple times for each (V_(dd),T) condition. Then, the expected values of the cells are computed at each temperature (step 75) and the cells that have expected values that change with temperature are excluded (step 76). The probabilities that the values of the cells will assume their corresponding expected values are computed (step 77) and the CID is constructed from pairs of the expected value and the probability for each cell (step 78). The tests should generally be performed at two temperatures using a nominal power supply voltage. The temperatures used should generally be the lower and upper bound of the recommended operating range of the device, e.g. at 25 C and 150 C, as noted in the section above entitled “Temperature Dependence of Fingerprints.” Additional temperatures can be included to improve the quality of the CID, but the empirical results described above indicate that additional temperature tests are generally not needed and generally do not improve the similarity gap for the process and design that were tested, which are expected to be representative of SRAM devices. The method can therefore be performed as follows: for each device, perform m>20 power-cycle experiments to gather power-up values of all n_(cells) memory cells for (V_(dd),T₀) and (V_(dd),T₁), where V_(dd) is the nominal supply voltage, T₀ is a low temperature near the lower bound of the manufacturer's recommended operating range, and T₁ is a high temperature near the upper bound of the manufacturer's operating range. From the method depicted in FIG. 14, a repository of CID's can be compiled for subsequent similarity matching. The flowchart of FIG. 14 is only one example of an identifier generation technique in accordance with an embodiment of the present invention, and other potential algorithms are contemplated by the present invention.

Device Identification

A method of device identification in accordance with an embodiment of the present invention is depicted in FIG. 15. First, the device is stabilized at a nominal power supply voltage (step 80), the initial states of the static storage cells in the device are read (step 81), an FID is generated from the values (step 82), and a stored CID is read from a repository (step 83). A mask of bits is selected (step 84), which may be a random selection or a selection of particular entries that will be matched between the CID and FID. However, the entire CID and FID may be matched, and the CID and FID may have any length sufficient to identify the device. (For any selection of a subset of values from the CID and FID, the entries corresponding to the same cells are selected for matching. Further, the FID entries may be extracted after the mask is selected, reducing the values that must be read from the device to extract the FID.) If the expected value entry in the FID matches the expected value entry in the corresponding position of the CID (decision 85), then the corresponding probability in the CID is added to the similarity total according to equation 14 above (step 86). If the value read from the device that forms an entry in the FID does not match the expected value for the corresponding entry in the CID (decision 85), then the corresponding probability is subtracted from the similarity total according to equation 14 above (step 87). Until the last entry position specified by the mask is compared (decision 88) steps 85-87 are repeated for the next position (step 89). After the last position is compared (decision 88), the similarity total is divided by the number of entries compared (step 90), e.g., the number of bits n in the mask selected in step 84, then the similarity total is compared to the similarity threshold (decision 91). If the similarity total meets or exceeds the threshold, a match is found (step 92) and the device is identified. If the similarity total does not meet the threshold (decision 91), then until the last CID in the repository has been compared (decision 93), the next CID is tested (step 94), repeating steps 83-91. If the similarity total does not meet the threshold (decision 91) after the last CID has been tested (step 93), then no match has been found (step 95) and the device is unidentified. The flowchart of FIG. 15 is only one example of an identification technique in accordance with an embodiment of the present invention, and other potential algorithms are contemplated by the present invention. For example, it can be advantageous to compare the FID against all CIDs in the repository, even if a match is found early on in the matching process. A stronger result is obtained when exactly one match is found across an entire set of potential identifiers, as multiple matches indicate the presence of a “false positive.”

In order to determine the length of the CID constructed and matched by an FID according to the above method, i.e., the number of cells for which data is needed after the exclusion in step 74 in order to perform step 76 in FIG. 14, to construct a sufficiently long CID, the following procedure can be used. Sample FID experiments are performed with n<n_(cells) randomly chosen memory cells. The desired fingerprint length n is then determined according to the planned number of manufactured chips as described in the section above labeled Birthday Paradox and Fingerprint Length, and the decision threshold T_(S) is determined for n such that

max(S _(mismatch))<min(S _(match))

and

T _(S)=(max(S _(mismatch))+min(S _(match)))/2.

Empirical Verification

The above-described chip identification technique depends on the stability of static storage cells within a device. For a sample of devices tested, up to approximately 20% of all cells are unstable, and the vast majority (90%) of those cells exhibit the temperature dependence described above. To obtain a temperature independent identification method, the temperature-variant cells are ignored in the similarity measure by constructing the CID to assert a probability of 0 that the temperature-variant cells will assume their expected power-up value. Hence, the similarity total is neither incremented nor decremented for those cells according to the method depicted in FIG. 15. It remains to demonstrate the claim that there exists a fingerprint length n for which the similarity measure of matching and mismatching fingerprints exhibits a similarity gap of width

S _(gap)=min(S _(match))−max(S _(mismatch))  (15)

with S_(gap)>0, such that a valid similarity threshold can be identified according to

T _(S)=(max(S _(mismatch))min(S _(match)))/2.

First, data are presented for a similarity gap over n_(cells)=4,096, in which the CID is determined for each chip based on the entire data set of power-up strings, which are the most accurate CIDs that can be produced from the data set. Rather than using data sets at two temperatures and the nominal voltage, as prescribed above, the power-up strings are obtained for all available (V_(dd),T)-conditions, and q(x_(i)) is computed as the mean value over all (V_(dd),T)-conditions. Furthermore, q(x_(i))=0 is asserted for all cells i with differing expected bits for any pair of (V_(dd),T)-conditions.

Next the power-up data is sub-sampled for fingerprint lengths n=[10, 20, 30, . . . , 1000]. For each length n 6,400 random selections of n distinct cells were performed. For each power-up bit-string, the power-up values of these n cells were extracted, interpreted as an FID, and the similarity was computed against each of the CIDs of the 32 sample die, using the same n cells. The matching similarities, in which the FID and the CID belong to the same die, are distinguished from the mismatching similarities, in which FID and the CID belong to different die, in the manner illustrated in FIG. 13, for which the actual data are depicted in the graphs of FIGS. 16A and 16B. FIG. 16A shows the similarity distributions of the normalized number of samples with a particular similarity value, for fingerprint length n=10. FIG. 16B shows the similarity distributions for fingerprint length n=140. As can be observed from the absence of any similarity gap in FIG. 16A, fingerprint length n=10 is too small for proper identification under the above-described conditions. However, for n=140 max(S_(mismatch))<min(S_(match)), which provides a similarity gap and therefore can provide proper identification of all devices. Furthermore, the distribution for n=140 shown in FIG. 16B has a smaller spread than the distribution for n=10, shown in FIG. 16A, as is expected from the central limit theorem as described above. Nevertheless, the expected values for the matching similarities, E(S_(match))≈0.85, and the mismatching similarities, E(S_(mismatch))≈00, are already visible for fingerprint lengths n=10. The relatively low expected similarity value for matching fingerprints suggests that a significant fraction of the cells with matching expected power-up values is relatively unstable.

FIG. 17A is a graph showing for fingerprint lengths n that are multiples of 10, the similarity bounds max(S_(mismatch)) (points marked “+”) and min(S_(match)) (points marked “x”). FIG. 17B is a graph showing the value of S_(gap) according to Equation 15 above. The graphs of FIG. 17A and FIG. 17B include data from all of the environmental temperature and power supply voltage conditions tested. The value of S_(gap) is negative for n<130 and positive for n≧130. Therefore, in the given example, a fingerprint length of at least n=130 bits is needed to identify the 32 die. To increase the confidence in the fingerprint method, n=200 bits and a threshold of about T_(S)=0.4 is recommended. As disclosed in the section above entitled Birthday Paradox and Fingerprint Length, the particular choice of n and T_(S) depends on the number of die to be distinguished and the variation in the fabrication process.

FIGS. 18A and 18B show the similarity bounds the corresponding similarity gap for fingerprints generated from two data sets at conditions (1V, 25° C.) and (1V, 150° C.). For die with incomplete data sets, the bit-strings at the minimum and maximum temperatures available were used. The reduction in temperature conditions required for evaluation and CID generation reduces the experimental characterization effort substantially, from performing at least 20 power-cycle experiments for each of |V_(dd)×T|=36 conditions to just 2 (V_(dd),T)-conditions tested per die. Yet, using just two data sets suffices to detect 83% of all cells with differing power-up values compared to using all of the available data sets. Comparing FIGS. 17A and 17B with FIGS. 18A and 18B, it is noted that the similarity gap becomes positive for n≧140 bits per fingerprint using a fixed power supply voltage and only two temperature conditions as opposed to n≧130 as shown in FIG. 17B. The small shift hardly effects the quality of the identification method, in part because a greater number of bits will generally be used, about n=200 bits per fingerprint, in order to increase the margin of error and to determine a safe similarity threshold T_(S).

Referring now to FIG. 19, an exemplary system that can be secured by the techniques of the present invention is shown. A processor 100 is coupled to a memory subsystem 102 and an I/O device 106. In each of memory subsystem 102 and I/O device 106, static storage cells 104 and 108 are respectively present. For example, static storage cells 104 may be a cache memory and static storage cells 108 may be a collection of latches within I/O device, or a device memory. Static storage cells 104 and 108 can be characterized according to the above-described CID generation method, and an FID extracted by processor 100 upon power-up from storage cells 104 and 108, respectively. Given a repository of CIDs, the extracted FIDs can be used by processor 100 to determine whether or not to halt execution. E.g., if one of memory subsystem 102 and/or I/O device 106 has been replaced with an unauthorized device. The CID repository may be a single CID stored within processor 100 for each subsystem being authenticated, if absolute authentication of a single device for each subsystem is needed. The system of FIG. 19 is only one example of a configuration illustrating a variety of types of storage cells and devices that may be identified by a processor in order to maintain system integrity, or make other decisions based on the particular devices identified by processor 100.

Referring now to FIG. 20, another exemplary system that can be secured by the techniques of the present invention is shown. A first VLSI device 110A is coupled to another VLSI device 110B. Each of VLSI devices 110A and 110B include different forms of static storage cells, provided for generality of illustration. In VLSI device 110A, latches forming logic circuits (not shown) are read by a scan chain interface 114 at power-up to generate an FID, and in VLSI device 110B, an internal SRAM 116 is used to generate an FID. A cryptographic processor 112A, 112B in each of VLSI devices 110A and 110B, uses the FIDs as private keys, which are combined with information stored in a corresponding non-volatile memory 118A, 118B to generate messages that are exchanged by VLSI devices 110A and 110B. The information stored in non-volatile memory 118A, 118B can be based upon a repository of CIDs or a single CID as mentioned above, so that VLSI devices 110A and 110B can each verify that they are connected to an authentic device. While the illustrated embodiment provides two-way authentication and cryptographic security for exemplary illustration, it is understood that non-cryptographic authentication and/or one-way authentication can be provided between two or more VLSI devices.

Referring now to FIG. 21, an exemplary processor 120 that can be secured by the techniques of the present invention is shown. Processor 120 includes an SRAM 122, e.g., an L1 cache that can be read at power-up to determine initial cell states. An FID generated from the initial states can be similarity matched by processor 120 against a value stored in a non-volatile program memory 124 or processor 120 may access an external database 126 (e.g., via a public or private network connection) to determine whether the FID matches one or more known CIDs. The processor illustrated in FIG. 21 can then self-authenticate to determine whether to execute, making it possible to protect otherwise obtainable program code by tying it to a specific die or group of dies.

While the invention has been particularly shown and described with reference to the preferred embodiments thereof, it will be understood by those skilled in the art that the foregoing and other changes in form, and details may be made therein without departing from the spirit and scope of the invention. 

What is claimed is:
 1. A method for authenticating an electronic device as being a particular electronic device, wherein the electronic device contains a number of static storage elements, and wherein the method comprises: applying power to the electronic device; reading initial values of the static storage elements after applying power to collect a set of power-up states; and comparing the initial values with an identifier corresponding to the particular electronic device, wherein the identifier encodes an expected value of a set of static storage elements within the particular device and probabilities that the static storage elements will assume their corresponding expected values, and wherein the comparing applies both the expected value of the set of static storage elements and probabilities that the static storage elements will assume their corresponding expected values.
 2. The method of claim 1, wherein the comparing further comprises: for the static storage elements of the particular electronic device that have an expected value that matches the initial value of the corresponding static storage elements in the electronic device, adding the corresponding probability to a total; for the static storage elements of the particular electronic device that have an expected value that does not match the initial value of the corresponding static storage elements in the electronic device, subtracting the corresponding probability from a total; and comparing the total to a threshold value to determine if a match between the initial values and the identifier is sufficient.
 3. The method of claim 2, further comprising dividing the total by a number of static storage elements included in the comparing.
 4. A computer system comprising: a processor for executing program instructions; and a memory coupled to the processor for storing the program instructions, wherein the program instructions comprise program instructions for authenticating an electronic device as being a particular electronic device, wherein the electronic device contains a number of static storage elements, wherein the program instructions comprise program instructions for, reading initial values of the static storage elements after power has been applied to the electronic device to collect a set of power-up states, and comparing the initial values with an identifier corresponding to the particular electronic device, wherein the identifier encodes an expected value of a set of static storage elements within the particular device and probabilities that the static storage elements will assume their corresponding expected values, and wherein the comparing applies both the expected value of the set of static storage elements and probabilities that the static storage elements will assume their corresponding expected values.
 5. The computer system of claim 4, wherein the program instructions for comparing further comprise program instructions for: for the static storage elements of the particular electronic device that have an expected value that matches the initial value of the corresponding static storage elements in the electronic device, adding the corresponding probability to a total; for the static storage elements of the particular electronic device that have an expected value that does not match the initial value of the corresponding static storage elements in the electronic device, subtracting the corresponding probability from a total; and comparing the total to a threshold value to determine if a match between the initial values and the identifier is sufficient.
 6. The computer system of claim 5, wherein the program instructions further comprise program instructions for dividing the total by a number of static storage elements included in the comparing.
 7. A computer program product comprising a computer-readable storage device storing program instructions for execution by a processing system, wherein the program instructions comprise program instructions for authenticating an electronic device as being a particular electronic device, wherein the electronic device contains a number of static storage elements, wherein the program instructions comprise program instructions for: reading initial values of the static storage elements after power has been applied to the electronic device to collect a set of power-up states; and comparing the initial values with an identifier corresponding to the particular electronic device, wherein the identifier encodes an expected value of a set of static storage elements within the particular device and probabilities that the static storage elements will assume their corresponding expected values, and wherein the comparing applies both the expected value of the set of static storage elements and probabilities that the static storage elements will assume their corresponding expected values.
 8. The computer program product of claim 7, wherein the program instructions for comparing further comprise program instructions for: for the static storage elements of the particular electronic device that have an expected value that matches the initial value of the corresponding static storage elements in the electronic device, adding the corresponding probability to a total; for the static storage elements of the particular electronic device that have an expected value that does not match the initial value of the corresponding static storage elements in the electronic device, subtracting the corresponding probability from a total; and comparing the total to a threshold value to determine if a match between the initial values and the identifier is sufficient.
 9. The computer program product of claim 8, further comprising program instructions for dividing the total by a number of static storage elements included in the comparing. 